Multi-relational Bayesian Classification through Genetic Approach

نویسندگان

  • Nitin Kumar Choudhary
  • Gaurav Shrivastava
  • Mahesh Malviya
چکیده

Classification is an important subject in data mining and machine learning, which has been studied extensively and has a wide range of applications. Classification based on association rules, also called associative classification, is a technique that uses association rules to build classifier. CMAR employs a novel data structure, association rule, to compactly store and efficiently retrieve a large number of rules for classification. Association rule is a prefix rule structure to explore the sharing among rules, which achieves substantial compactness. To speed up the mining of complete set of rules, CMAR adopts a variant of recently developed FPgrowth method. FP-growth is much faster than Apriori-like methods used in previous association-based classification, such as especially when there exist a huge number of rules, large training data sets, and long pattern rules. We use classification using association rules not only to solve classification problems, but also to compare the quality of different association rule mining approaches. In this context we show that the quality of rule sets from the standard algorithm for association rule mining can be improved by using a different association rule mining strategy Above classification rate is 80%( MAX) hence the 20% data are unclassified. This is a challenge in the field of data classification. In this paper, we used multiple relational Bayesian classification algorithm based on genetic algorithm used for optimization of classification rate, generated by association rule.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Entropy Based Feature Selection For Multi-Relational Naïve Bayesian Classifier

Current industries data’s are stored in relation structures. In usual approach to mine these data, we often use to join several relations to form a single relation using foreign key links, which is known as flatten. Flatten may cause troubles such as time consuming, data redundancy and statistical skew on data. Hence, the critical issues arise that how to mine data directly on numerous relation...

متن کامل

Multi-objective optimization in WEDM of D3 tool steel using integrated approach of Taguchi method & Grey relational analysis

In this paper, wire electrical discharge machining of D3 tool steel is studied. Influence of pulse-on time, pulse-off time, peak current and wire speed are investigated for MRR, dimensional deviation, gap current and machining time, during intricate machining of D3 tool steel. Taguchi method is used for single characteristics optimization and to optimize all four process parameters simultaneous...

متن کامل

Model Selection Scores for Multi - Relational Bayesian Networks ∗

Many organizations maintain their data in a relational database, which contains information about entities, their attributes, relationships among the entities, and attributes of the relationships. Statistical-relational learning (SRL) aims to generalize traditional single-table machine learning methods for multi-relational data. Many SRL models are defined using a combination of graphs and firs...

متن کامل

Approaching the ILP 2005 Challenge: Class-Conditional Bayesian Propositionalization for Genetic Classification

This report presents a statistical propositionalisation approach to relational classification and probability estimation on the genetic ILP Challenge domain. The main difference between our and existing propositionalisation approaches is its ability to construct features from categorical attributes with many possible values and in particular the object identifiers. Our classification and rankin...

متن کامل

An Efficient Multi-relational Naïve Bayesian Classifier Based on Semantic Relationship Graph

Classification is one of the most popular data mining tasks with a wide range of applications, and lots of algorithms have been proposed to build accurate and scalable classifiers. Most of these algorithms only take a single table as input, whereas in the real world most data are stored in multiple tables and managed by relational database systems. As transferring data from multiple tables into...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012